Dataset statistics
| Number of variables | 20 |
|---|---|
| Number of observations | 4888 |
| Missing cells | 1012 |
| Missing cells (%) | 1.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 763.9 KiB |
| Average record size in memory | 160.0 B |
Variable types
| CAT | 9 |
|---|---|
| NUM | 8 |
| BOOL | 3 |
Designation is highly correlated with ProductPitched | High correlation |
ProductPitched is highly correlated with Designation | High correlation |
Age has 226 (4.6%) missing values | Missing |
DurationOfPitch has 251 (5.1%) missing values | Missing |
NumberOfTrips has 140 (2.9%) missing values | Missing |
NumberOfChildrenVisiting has 66 (1.4%) missing values | Missing |
MonthlyIncome has 233 (4.8%) missing values | Missing |
CustomerID has unique values | Unique |
Reproduction
| Analysis started | 2022-09-24 17:26:26.441955 |
|---|---|
| Analysis finished | 2022-09-24 17:26:34.969650 |
| Duration | 8.53 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 4888 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 202443.5 |
|---|---|
| Minimum | 200000 |
| Maximum | 204887 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 38.2 KiB |
Quantile statistics
| Minimum | 200000 |
|---|---|
| 5-th percentile | 200244.35 |
| Q1 | 201221.75 |
| median | 202443.5 |
| Q3 | 203665.25 |
| 95-th percentile | 204642.65 |
| Maximum | 204887 |
| Range | 4887 |
| Interquartile range (IQR) | 2443.5 |
Descriptive statistics
| Standard deviation | 1411.188388 |
|---|---|
| Coefficient of variation (CV) | 0.006970776479 |
| Kurtosis | -1.2 |
| Mean | 202443.5 |
| Median Absolute Deviation (MAD) | 1222 |
| Skewness | 0 |
| Sum | 989543828 |
| Variance | 1991452.667 |
| Monotocity | Strictly increasing |
| Value | Count | Frequency (%) | |
| 200702 | 1 | < 0.1% | |
| 201479 | 1 | < 0.1% | |
| 203514 | 1 | < 0.1% | |
| 201467 | 1 | < 0.1% | |
| 203518 | 1 | < 0.1% | |
| 201471 | 1 | < 0.1% | |
| 203522 | 1 | < 0.1% | |
| 201475 | 1 | < 0.1% | |
| 203526 | 1 | < 0.1% | |
| 203530 | 1 | < 0.1% | |
| Other values (4878) | 4878 | 99.8% |
| Value | Count | Frequency (%) | |
| 200000 | 1 | < 0.1% | |
| 200001 | 1 | < 0.1% | |
| 200002 | 1 | < 0.1% | |
| 200003 | 1 | < 0.1% | |
| 200004 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 204887 | 1 | < 0.1% | |
| 204886 | 1 | < 0.1% | |
| 204885 | 1 | < 0.1% | |
| 204884 | 1 | < 0.1% | |
| 204883 | 1 | < 0.1% |
ProdTaken
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.2 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 3968 | 81.2% | |
| 1 | 920 | 18.8% |
| Distinct | 44 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 226 |
| Missing (%) | 4.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.62226512 |
|---|---|
| Minimum | 18 |
| Maximum | 61 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 38.2 KiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 24 |
| Q1 | 31 |
| median | 36 |
| Q3 | 44 |
| 95-th percentile | 55 |
| Maximum | 61 |
| Range | 43 |
| Interquartile range (IQR) | 13 |
Descriptive statistics
| Standard deviation | 9.316387033 |
|---|---|
| Coefficient of variation (CV) | 0.2476296151 |
| Kurtosis | -0.4513319674 |
| Mean | 37.62226512 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 0.3829886837 |
| Sum | 175395 |
| Variance | 86.79506734 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 35 | 237 | 4.8% | |
| 36 | 231 | 4.7% | |
| 34 | 211 | 4.3% | |
| 31 | 203 | 4.2% | |
| 30 | 199 | 4.1% | |
| 32 | 197 | 4.0% | |
| 33 | 189 | 3.9% | |
| 37 | 185 | 3.8% | |
| 29 | 178 | 3.6% | |
| 38 | 176 | 3.6% | |
| Other values (34) | 2656 | 54.3% | |
| (Missing) | 226 | 4.6% |
| Value | Count | Frequency (%) | |
| 18 | 14 | 0.3% | |
| 19 | 32 | 0.7% | |
| 20 | 38 | 0.8% | |
| 21 | 41 | 0.8% | |
| 22 | 46 | 0.9% |
| Value | Count | Frequency (%) | |
| 61 | 9 | 0.2% | |
| 60 | 29 | 0.6% | |
| 59 | 44 | 0.9% | |
| 58 | 31 | 0.6% | |
| 57 | 29 | 0.6% |
TypeofContact
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 25 |
| Missing (%) | 0.5% |
| Memory size | 38.2 KiB |
| Self Enquiry | |
|---|---|
| Company Invited |
| Value | Count | Frequency (%) | |
| Self Enquiry | 3444 | 70.5% | |
| Company Invited | 1419 | 29.0% | |
| (Missing) | 25 | 0.5% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 15 |
|---|---|
| Median length | 12 |
| Mean length | 12.82487725 |
| Min length | 3 |
CityTier
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.2 KiB |
| 1 | |
|---|---|
| 3 | |
| 2 | 198 |
| Value | Count | Frequency (%) | |
| 1 | 3190 | 65.3% | |
| 3 | 1500 | 30.7% | |
| 2 | 198 | 4.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
| Distinct | 34 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 251 |
| Missing (%) | 5.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.49083459 |
|---|---|
| Minimum | 5 |
| Maximum | 127 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 38.2 KiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 9 |
| median | 13 |
| Q3 | 20 |
| 95-th percentile | 32 |
| Maximum | 127 |
| Range | 122 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 8.519642589 |
|---|---|
| Coefficient of variation (CV) | 0.5499795727 |
| Kurtosis | 11.79749394 |
| Mean | 15.49083459 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 1.752037049 |
| Sum | 71831 |
| Variance | 72.58430985 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 9 | 483 | 9.9% | |
| 7 | 342 | 7.0% | |
| 8 | 333 | 6.8% | |
| 6 | 307 | 6.3% | |
| 16 | 274 | 5.6% | |
| 15 | 269 | 5.5% | |
| 14 | 253 | 5.2% | |
| 10 | 244 | 5.0% | |
| 13 | 223 | 4.6% | |
| 11 | 205 | 4.2% | |
| Other values (24) | 1704 | 34.9% | |
| (Missing) | 251 | 5.1% |
| Value | Count | Frequency (%) | |
| 5 | 6 | 0.1% | |
| 6 | 307 | 6.3% | |
| 7 | 342 | 7.0% | |
| 8 | 333 | 6.8% | |
| 9 | 483 | 9.9% |
| Value | Count | Frequency (%) | |
| 127 | 1 | < 0.1% | |
| 126 | 1 | < 0.1% | |
| 36 | 44 | 0.9% | |
| 35 | 66 | 1.4% | |
| 34 | 50 | 1.0% |
Occupation
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.2 KiB |
| Salaried | |
|---|---|
| Small Business | |
| Large Business | |
| Free Lancer | 2 |
| Value | Count | Frequency (%) | |
| Salaried | 2368 | 48.4% | |
| Small Business | 2084 | 42.6% | |
| Large Business | 434 | 8.9% | |
| Free Lancer | 2 | < 0.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 11.09206219 |
| Min length | 8 |
Gender
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.2 KiB |
| Male | |
|---|---|
| Female | |
| Fe Male | 155 |
| Value | Count | Frequency (%) | |
| Male | 2916 | 59.7% | |
| Female | 1817 | 37.2% | |
| Fe Male | 155 | 3.2% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 7 |
|---|---|
| Median length | 4 |
| Mean length | 4.838584288 |
| Min length | 4 |
NumberOfPersonVisiting
Real number (ℝ≥0)
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.90507365 |
|---|---|
| Minimum | 1 |
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 38.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 3 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 5 |
| Range | 4 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.724890595 |
|---|---|
| Coefficient of variation (CV) | 0.2495257203 |
| Kurtosis | -0.7774673393 |
| Mean | 2.90507365 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.02981670374 |
| Sum | 14200 |
| Variance | 0.5254663748 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 3 | 2402 | 49.1% | |
| 2 | 1418 | 29.0% | |
| 4 | 1026 | 21.0% | |
| 1 | 39 | 0.8% | |
| 5 | 3 | 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 39 | 0.8% | |
| 2 | 1418 | 29.0% | |
| 3 | 2402 | 49.1% | |
| 4 | 1026 | 21.0% | |
| 5 | 3 | 0.1% |
| Value | Count | Frequency (%) | |
| 5 | 3 | 0.1% | |
| 4 | 1026 | 21.0% | |
| 3 | 2402 | 49.1% | |
| 2 | 1418 | 29.0% | |
| 1 | 39 | 0.8% |
NumberOfFollowups
Real number (ℝ≥0)
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 45 |
| Missing (%) | 0.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.708445179 |
|---|---|
| Minimum | 1 |
| Maximum | 6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 38.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 4 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 6 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.002508686 |
|---|---|
| Coefficient of variation (CV) | 0.2703312677 |
| Kurtosis | 0.6203311898 |
| Mean | 3.708445179 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.3727193989 |
| Sum | 17960 |
| Variance | 1.005023666 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 4 | 2068 | 42.3% | |
| 3 | 1466 | 30.0% | |
| 5 | 768 | 15.7% | |
| 2 | 229 | 4.7% | |
| 1 | 176 | 3.6% | |
| 6 | 136 | 2.8% | |
| (Missing) | 45 | 0.9% |
| Value | Count | Frequency (%) | |
| 1 | 176 | 3.6% | |
| 2 | 229 | 4.7% | |
| 3 | 1466 | 30.0% | |
| 4 | 2068 | 42.3% | |
| 5 | 768 | 15.7% |
| Value | Count | Frequency (%) | |
| 6 | 136 | 2.8% | |
| 5 | 768 | 15.7% | |
| 4 | 2068 | 42.3% | |
| 3 | 1466 | 30.0% | |
| 2 | 229 | 4.7% |
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.2 KiB |
| Basic | |
|---|---|
| Deluxe | |
| Standard | |
| Super Deluxe | |
| King |
| Value | Count | Frequency (%) | |
| Basic | 1842 | 37.7% | |
| Deluxe | 1732 | 35.4% | |
| Standard | 742 | 15.2% | |
| Super Deluxe | 342 | 7.0% | |
| King | 230 | 4.7% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 12 |
|---|---|
| Median length | 6 |
| Mean length | 6.252454992 |
| Min length | 4 |
PreferredPropertyStar
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 26 |
| Missing (%) | 0.5% |
| Memory size | 38.2 KiB |
| 3 | |
|---|---|
| 5 | |
| 4 |
| Value | Count | Frequency (%) | |
| 3 | 2993 | 61.2% | |
| 5 | 956 | 19.6% | |
| 4 | 913 | 18.7% | |
| (Missing) | 26 | 0.5% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
MaritalStatus
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.2 KiB |
| Married | |
|---|---|
| Divorced | |
| Single | |
| Unmarried |
| Value | Count | Frequency (%) | |
| Married | 2340 | 47.9% | |
| Divorced | 950 | 19.4% | |
| Single | 916 | 18.7% | |
| Unmarried | 682 | 14.0% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 7.286006547 |
| Min length | 6 |
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 140 |
| Missing (%) | 2.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.23652064 |
|---|---|
| Minimum | 1 |
| Maximum | 22 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 38.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 7 |
| Maximum | 22 |
| Range | 21 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.84901931 |
|---|---|
| Coefficient of variation (CV) | 0.5712984761 |
| Kurtosis | 6.0990233 |
| Mean | 3.23652064 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.453883784 |
| Sum | 15367 |
| Variance | 3.418872408 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 2 | 1464 | 30.0% | |
| 3 | 1079 | 22.1% | |
| 1 | 620 | 12.7% | |
| 4 | 478 | 9.8% | |
| 5 | 458 | 9.4% | |
| 6 | 322 | 6.6% | |
| 7 | 218 | 4.5% | |
| 8 | 105 | 2.1% | |
| 21 | 1 | < 0.1% | |
| 19 | 1 | < 0.1% | |
| Other values (2) | 2 | < 0.1% | |
| (Missing) | 140 | 2.9% |
| Value | Count | Frequency (%) | |
| 1 | 620 | 12.7% | |
| 2 | 1464 | 30.0% | |
| 3 | 1079 | 22.1% | |
| 4 | 478 | 9.8% | |
| 5 | 458 | 9.4% |
| Value | Count | Frequency (%) | |
| 22 | 1 | < 0.1% | |
| 21 | 1 | < 0.1% | |
| 20 | 1 | < 0.1% | |
| 19 | 1 | < 0.1% | |
| 8 | 105 | 2.1% |
Passport
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.2 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 3466 | 70.9% | |
| 1 | 1422 | 29.1% |
PitchSatisfactionScore
Real number (ℝ≥0)
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.078150573 |
|---|---|
| Minimum | 1 |
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 38.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 4 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.365791728 |
|---|---|
| Coefficient of variation (CV) | 0.4437053014 |
| Kurtosis | -1.102869771 |
| Mean | 3.078150573 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.1277255598 |
| Sum | 15046 |
| Variance | 1.865387043 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 3 | 1478 | 30.2% | |
| 5 | 970 | 19.8% | |
| 1 | 942 | 19.3% | |
| 4 | 912 | 18.7% | |
| 2 | 586 | 12.0% |
| Value | Count | Frequency (%) | |
| 1 | 942 | 19.3% | |
| 2 | 586 | 12.0% | |
| 3 | 1478 | 30.2% | |
| 4 | 912 | 18.7% | |
| 5 | 970 | 19.8% |
| Value | Count | Frequency (%) | |
| 5 | 970 | 19.8% | |
| 4 | 912 | 18.7% | |
| 3 | 1478 | 30.2% | |
| 2 | 586 | 12.0% | |
| 1 | 942 | 19.3% |
OwnCar
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.2 KiB |
| 1 | |
|---|---|
| 0 |
| Value | Count | Frequency (%) | |
| 1 | 3032 | 62.0% | |
| 0 | 1856 | 38.0% |
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 66 |
| Missing (%) | 1.4% |
| Memory size | 38.2 KiB |
| 1 | |
|---|---|
| 2 | |
| 0 | |
| 3 |
| Value | Count | Frequency (%) | |
| 1 | 2080 | 42.6% | |
| 2 | 1335 | 27.3% | |
| 0 | 1082 | 22.1% | |
| 3 | 325 | 6.6% | |
| (Missing) | 66 | 1.4% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.2 KiB |
| Executive | |
|---|---|
| Manager | |
| Senior Manager | |
| AVP | |
| VP |
| Value | Count | Frequency (%) | |
| Executive | 1842 | 37.7% | |
| Manager | 1732 | 35.4% | |
| Senior Manager | 742 | 15.2% | |
| AVP | 342 | 7.0% | |
| VP | 230 | 4.7% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 14 |
|---|---|
| Median length | 9 |
| Mean length | 8.301145663 |
| Min length | 2 |
| Distinct | 2475 |
|---|---|
| Distinct (%) | 53.2% |
| Missing | 233 |
| Missing (%) | 4.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23619.85349 |
|---|---|
| Minimum | 1000 |
| Maximum | 98678 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 38.2 KiB |
Quantile statistics
| Minimum | 1000 |
|---|---|
| 5-th percentile | 17295.1 |
| Q1 | 20346 |
| median | 22347 |
| Q3 | 25571 |
| 95-th percentile | 34723.9 |
| Maximum | 98678 |
| Range | 97678 |
| Interquartile range (IQR) | 5225 |
Descriptive statistics
| Standard deviation | 5380.698361 |
|---|---|
| Coefficient of variation (CV) | 0.2278040532 |
| Kurtosis | 14.8440669 |
| Mean | 23619.85349 |
| Median Absolute Deviation (MAD) | 2603 |
| Skewness | 1.949159832 |
| Sum | 109950418 |
| Variance | 28951914.85 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 20855 | 7 | 0.1% | |
| 17342 | 7 | 0.1% | |
| 21288 | 7 | 0.1% | |
| 21020 | 7 | 0.1% | |
| 25482 | 6 | 0.1% | |
| 24950 | 6 | 0.1% | |
| 25025 | 6 | 0.1% | |
| 22130 | 6 | 0.1% | |
| 21419 | 6 | 0.1% | |
| 21237 | 6 | 0.1% | |
| Other values (2465) | 4591 | 93.9% | |
| (Missing) | 233 | 4.8% |
| Value | Count | Frequency (%) | |
| 1000 | 1 | < 0.1% | |
| 4678 | 1 | < 0.1% | |
| 16009 | 2 | < 0.1% | |
| 16051 | 2 | < 0.1% | |
| 16052 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 98678 | 1 | < 0.1% | |
| 95000 | 1 | < 0.1% | |
| 38677 | 2 | < 0.1% | |
| 38651 | 2 | < 0.1% | |
| 38621 | 2 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| CustomerID | ProdTaken | Age | TypeofContact | CityTier | DurationOfPitch | Occupation | Gender | NumberOfPersonVisiting | NumberOfFollowups | ProductPitched | PreferredPropertyStar | MaritalStatus | NumberOfTrips | Passport | PitchSatisfactionScore | OwnCar | NumberOfChildrenVisiting | Designation | MonthlyIncome | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 200000 | 1 | 41.0 | Self Enquiry | 3 | 6.0 | Salaried | Female | 3 | 3.0 | Deluxe | 3.0 | Single | 1.0 | 1 | 2 | 1 | 0.0 | Manager | 20993.0 |
| 1 | 200001 | 0 | 49.0 | Company Invited | 1 | 14.0 | Salaried | Male | 3 | 4.0 | Deluxe | 4.0 | Divorced | 2.0 | 0 | 3 | 1 | 2.0 | Manager | 20130.0 |
| 2 | 200002 | 1 | 37.0 | Self Enquiry | 1 | 8.0 | Free Lancer | Male | 3 | 4.0 | Basic | 3.0 | Single | 7.0 | 1 | 3 | 0 | 0.0 | Executive | 17090.0 |
| 3 | 200003 | 0 | 33.0 | Company Invited | 1 | 9.0 | Salaried | Female | 2 | 3.0 | Basic | 3.0 | Divorced | 2.0 | 1 | 5 | 1 | 1.0 | Executive | 17909.0 |
| 4 | 200004 | 0 | NaN | Self Enquiry | 1 | 8.0 | Small Business | Male | 2 | 3.0 | Basic | 4.0 | Divorced | 1.0 | 0 | 5 | 1 | 0.0 | Executive | 18468.0 |
| 5 | 200005 | 0 | 32.0 | Company Invited | 1 | 8.0 | Salaried | Male | 3 | 3.0 | Basic | 3.0 | Single | 1.0 | 0 | 5 | 1 | 1.0 | Executive | 18068.0 |
| 6 | 200006 | 0 | 59.0 | Self Enquiry | 1 | 9.0 | Small Business | Female | 2 | 2.0 | Basic | 5.0 | Divorced | 5.0 | 1 | 2 | 1 | 1.0 | Executive | 17670.0 |
| 7 | 200007 | 0 | 30.0 | Self Enquiry | 1 | 30.0 | Salaried | Male | 3 | 3.0 | Basic | 3.0 | Married | 2.0 | 0 | 2 | 0 | 1.0 | Executive | 17693.0 |
| 8 | 200008 | 0 | 38.0 | Company Invited | 1 | 29.0 | Salaried | Male | 2 | 4.0 | Standard | 3.0 | Unmarried | 1.0 | 0 | 3 | 0 | 0.0 | Senior Manager | 24526.0 |
| 9 | 200009 | 0 | 36.0 | Self Enquiry | 1 | 33.0 | Small Business | Male | 3 | 3.0 | Deluxe | 3.0 | Divorced | 7.0 | 0 | 3 | 1 | 0.0 | Manager | 20237.0 |
Last rows
| CustomerID | ProdTaken | Age | TypeofContact | CityTier | DurationOfPitch | Occupation | Gender | NumberOfPersonVisiting | NumberOfFollowups | ProductPitched | PreferredPropertyStar | MaritalStatus | NumberOfTrips | Passport | PitchSatisfactionScore | OwnCar | NumberOfChildrenVisiting | Designation | MonthlyIncome | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4878 | 204878 | 1 | 35.0 | Self Enquiry | 1 | 17.0 | Small Business | Male | 3 | 4.0 | Deluxe | 5.0 | Unmarried | 3.0 | 0 | 4 | 0 | 1.0 | Manager | 24803.0 |
| 4879 | 204879 | 1 | 26.0 | Self Enquiry | 2 | 27.0 | Small Business | Female | 4 | 4.0 | Basic | 4.0 | Married | 2.0 | 1 | 3 | 0 | 2.0 | Executive | 22347.0 |
| 4880 | 204880 | 1 | 59.0 | Self Enquiry | 1 | 28.0 | Small Business | Female | 4 | 4.0 | Deluxe | 4.0 | Married | 6.0 | 0 | 3 | 1 | 2.0 | Manager | 28686.0 |
| 4881 | 204881 | 1 | 41.0 | Self Enquiry | 2 | 25.0 | Salaried | Male | 3 | 2.0 | Basic | 5.0 | Married | 2.0 | 0 | 1 | 1 | 2.0 | Executive | 21065.0 |
| 4882 | 204882 | 1 | 37.0 | Self Enquiry | 2 | 20.0 | Salaried | Male | 3 | 5.0 | Basic | 5.0 | Married | 6.0 | 1 | 5 | 1 | 2.0 | Executive | 23317.0 |
| 4883 | 204883 | 1 | 49.0 | Self Enquiry | 3 | 9.0 | Small Business | Male | 3 | 5.0 | Deluxe | 4.0 | Unmarried | 2.0 | 1 | 1 | 1 | 1.0 | Manager | 26576.0 |
| 4884 | 204884 | 1 | 28.0 | Company Invited | 1 | 31.0 | Salaried | Male | 4 | 5.0 | Basic | 3.0 | Single | 3.0 | 1 | 3 | 1 | 2.0 | Executive | 21212.0 |
| 4885 | 204885 | 1 | 52.0 | Self Enquiry | 3 | 17.0 | Salaried | Female | 4 | 4.0 | Standard | 4.0 | Married | 7.0 | 0 | 1 | 1 | 3.0 | Senior Manager | 31820.0 |
| 4886 | 204886 | 1 | 19.0 | Self Enquiry | 3 | 16.0 | Small Business | Male | 3 | 4.0 | Basic | 3.0 | Single | 3.0 | 0 | 5 | 0 | 2.0 | Executive | 20289.0 |
| 4887 | 204887 | 1 | 36.0 | Self Enquiry | 1 | 14.0 | Salaried | Male | 4 | 4.0 | Basic | 4.0 | Unmarried | 3.0 | 1 | 3 | 1 | 2.0 | Executive | 24041.0 |